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RAW SEQUENCE LISTING 
ERROR REPORT 
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SYSTEMS 
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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: O?/ O & 

Source: / & ^ ^ 

Date Processed by STIC: S~ 3 ~ O I 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY - 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: natin21helo@usDto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail hcln: Datin3help@usnto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a statc-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR§§ 1.82 1 - 1 .825 effective October I, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://w\v\v.uspto.gov/\veb/offices/pac/checker 



Sequence Listing Error Jj^imary 



n 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: D 9/^/?^ & ¥7 g 



ATTN: NEW RULES CASES: 
1 Wrapped Nucleics 



PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 
The numberAext at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Wrapped Aminos The amino acid numberAext at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



Variable Length Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate In the (ix) feature section that some may be missing. 



Patentln ver. 2.0 "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCR!PTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Please also adjust the "(Hi) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



10 



11 



Use of n's or Xaa's 
(NEW RULES) 



Use of "Artificial" 
(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to ,<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Use of "Artificial" only as "<213> Organism" response is incomplete, per 1 .823(b) of New Sequence Rules. 
Valid response is Artificial Sequence. 



12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 
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196 <211> LENGTH: 936 

197 <212> TYPE: DNA 

198 <213> ORGANISM: Renilla reniformis (mutated sequence) 
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He 


Glu 


Pro 


Val 


Ala 


Arg 


Cys 


He 


He 


Pro 


Asp 


Leu 


He Gly 




223 


65 










70 










75 










80 




225 


atg 


ggc 


aaa 


tea 


ggc 


aaa 


tct 


ggt 


aat 


ggt 


tct 


tat 


agg 


tta 


ctt 


gat 


288 
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ZZo 


Met 


pi tr 
tj±y 


Lys 


Ser 


uiy 


Lys 


Ser 


uiy 


Asn 


L>iy 


Ser 


Tyr 


Arg 


Leu 


Leu 


Asp 




221 










0 c 
0 D 










yu 










y D 






TOO 

z z y 


Cat. 


fan 


aaa 


LOU 


^4-4- 
CLL 


aCL 


gca 


tgg 


4-4-4- 


gaa 


4- 

ctt 


^-,4- 4- 

ctt 


a a 4- 

aa t 


4- 4- a 

tta 


cca 


aag 


*3 "3 £ 
J JO 


2 30 


His 


Tyr 


Lys 


Tyr 


Leu 


Thr 


Ala 


Trp 


Phe 


Glu 


Leu 


Leu 


Asn 


Leu 


Pro 


Lys 




Til 

Z 5 1 








1UU 










1UD 










11U 








Oil 
Z j J 


a&9 


a +- r> 

aLC 


att 


4-4-4- 


gtc 


ggc 


rua -r- 


gat 


tgg 


ggt 


get 


tgt 


4- 4- /r 

ttg 


gca 


4-4-4- 
ttt 


0 4- 
cat 


Q Q A 

0 o4 


2 34 


Lys 


lie 


lie 


Pne 


Val 


Gly 


His 


Asp 


Trp 


Gly 


Ala 


Cys 


Leu 


Ala 


■pi "U ^ 

Pne 


His 




ZoO 






113 










izu 










1ZD 










TOT 
ZJ / 




age 




gag 


Cat 


caa 


gat 


aag 


ate 


aaa 


gca 


a 4- a 

ata 


<-r4- 4- 

gtt 


cac 


get 


gaa 


A ~i O 
4 iZ 


Zoo 


Tyr 


Crt T* 

ber 


Tyr 


blU 


His 


bin 


Asp 


Lys 


lie 


Lys 


Ala 


Tl * 

lie 


val 


His 


Ala 


Glu 




2 3 9 




I3U 










130 




- 






140 
















/-t+- =1 

g ta 


gta 


gat 


gtg 


a 4- 4- 


gaa 


4- *~i=s 

tea 


tgg 


gat 


gaa 


tgg 


'">t4- 


n 


^ 1~ i~ 


gaa 


a q n 
4 oU 


Z 4 Z 


Ser 


val 


val 


Asp 


IT- T 

val 


Tl - 

lie 


Glu 


Ser 


Trp 


Asp 


Glu 


Trp 


Pro 


Asp 


Tl - 

lie 


Glu 




24 3 


14 o 










IbU 










155 










160 




Z4 O 


gaa 


i-r -a 4- 

gat 


.4.4. 
att 


gcg 


4- 4- n 
ttg 


ate 


aaa 


4- 4- 

tct 


gaa 


gaa 


gga 


gaa 


aaa 


atg 


gtt 


ttg 


C T O 

oZo 




blU 


Asp 


I le 


TV 1 _ 

Ala 


Leu 


I le 


Lys 


Ser 


Glu 


Glu 


Gly 


Glu 


Lys 


Met 


val 


Leu 




24 7 










165 










170 










175 






O >l Q 

z 4 y 


gag 




aac 


4-4- *~i 

ttC 


4- 4- St 

ttc 


gtg 


gaa 


ace 


atg 


4- 4- n 

ttg 


cca 


4- _ _ 
tea 


aaa 


ate 


4- i~r 

atg 


aga 


D /D 


250 


Glu 


Asn 


Asn 


Phe 


Phe 


Val 


Glu 


Thr 


Met 


Leu 


Pro 


Ser 


Lys 


He 


Met 


Arg 




251 








180 










185 










190 










aag 


4- -I . 

tta 


gaa 


cca 


gac 


gaa 


ST 4- 4- 

gtt 


gac 


gca 


4- 4- 

tat 


ctt 


gaa 


cca 


ttc 


aaa 


gag 


b^4 


254 


Lys 


Leu 


Glu 


Pro 


Asp 


Glu 


Val 


Asp 


Ala 


Tyr 


Leu 


Glu 


Pro 


Phe 


Lys 


Glu 




255 






195 










200 










205 










257 


aaa 


ggt 


gaa 


gtt 


cgt 


cgt 


cca 


aca 


tta 


tea 


tgg 


cct 


cgt 


gaa 


ate 


ccg 


672 


258 


Lys 


Gly 


Glu 


Val 


Arg 


Arg 


Pro 


Thr 


Leu 


Ser 


Trp 


'Pro 


Arg 


Glu 


He 


Pro 




259 




210 










215 










220 












z bl 


tta 


gta 


aaa 


ggt 


ggt 


aaa 


cct 


gac 


gtt 


gta 


caa 


att 


gtt 


agg 


aat 


tat 


720 


262 


Leu 


Val 


Lys 


Gly 


Gly 


Lys 


Pro 


Asp 


Val 


Val 


Gin 


He 


Val 


Arg 


Asn 


Tyr 




263 


225 










230 










235 










240 




2dd 


aat 


get 


tat 


eta 


cgt 


gca 


agt 


gat 


gat 


tta 


cca 


aaa 


atg 


ttt 


att 


gaa 


768 


266 


Asn 


Ala 


Tyr 


Leu 


Arg 


Ala 


Ser 


Asp 


Asp 


Leu 


Pro 


Lys 


Met 


Phe 


He 


Glu 




267 










245 










250 










255 






269 


teg 


gat 


cca 


gga 


ttc 


ttt 


tec 


aat 


get 


att 


gtt 


gaa 


ggc 


gee 


aag 


aag 


816 


270 


Ser 


Asp 


Pro 


Gly 


Phe 


Phe 


Ser 


Asn 


Ala 


lie 


Val 


Glu 


Gly 


Ala 


Lys 


Lys 




271 








260 










265 










270 








1 *7 O 

Z 1 5 


ttt 


cct 


aat 


act 


gaa 


ttt 


gtc 


aaa 


gta 


aaa 


ggt 


ctt 


cat 


ttt 


teg 


caa 


864 


274 


Phe 


Pro 


Asn . 


Thr 


Glu 


Phe 


Val 


Lys 


Val 


Lys 


Gly 


Leu 


His 


Phe 


Ser 


Gin 




275 






275 










280 










285 










277 


gaa 


gat 


gca 


cct 


gat 


gaa 


atg 


gga 


aaa 


tat 


ate 


aaa 


teg 


ttc 


gtt 


gag 


912 


278 


Glu 


Asp 


Ala 


Pro 


Asp 


Glu 


Met 


Gly 


Lys 


Tyr 


He 


Lys 


Ser 


Phe 


Val 


Glu 




279 




290 










295 










300 












281 


cga 


gtt 


etc 


aaa 


aat 


gaa 


caa 


taa 


















936 


282 


Arg 


Val 


Leu 


Lys 


Asn 


Glu 


Gin 






















283 


305 










310 
























286 


<210> SEQ ID NO: 


4 


























287 


<211> LENGTH: 311 


























288 


<212> TYPE: 


PRT 




























289 


<213> ORGANISM: 


Renilla 


reniformis (mutated 


sequence) 








291 


<400> SEQUENCE: 


4 
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TOT 

z y 3 


Met 


Thr 


Ser 


Lys 


Val 


Tyr 


Asp 


Pro 


Glu. .Gin 


Arg 


Lys 


Arg 


Met 


lie 


Thr 


zy 4 


1 








5 










10 










Id 




zy / 


Gly 


Pro 


fin 

bin 


Trp 


Trp 


Ala 


Arg 


Cys 


Lys 


Gin 


Met 


Asn 


Val 


Leu 


Asp 


Ser 


^y o 








20 










25 










30 






Jul 


Phe 


He 


Asn 


Tyr 


Tyr 


Asp 


Ser 


Glu 


Lys 


His 


Ala 


Glu 


Asn 


Ala 


vai 


Tift 

lie 


jUz 






J D 










40 










45 








o n ^ 


Phe 


Leu 


His 


Gly 


Asn 


Ala 


Ala 


Ser 


Ser 


Tyr 


Leu 


Trp 


Arg 


His 


vai 


vai 






50 










55 










60 










J U 3 


Pro 


His 


Tin 

lie 


Glu 


Pro 


Val 


Ala 


Arg 


Cys 


He 


He 


Pro 


Asp 


Leu 


lie 


pi , r 


oil) 


65 










70 










75 










OU 


J 1 J 


Met Gly 


Lys 


Ser 


Gly 


Lys 


Ser 


Gly 


Asn 


Gly 


Ser 


Tyr 


Arg 


Leu 


Leu 


Asp 


O 1 /I 

J -L ■* 










85 










90 










y d 






His 


Tyr 


Lys 


Tyr 


Leu 


Thr 


Ala 


Trp 


Phe 


Glu 


Leu 


Leu 


Asn 


Leu 


Pro 


Lys 










100 










105 










110 






3ZX. 


Lys 


lie 


ne 


Phe 


Val 


Gly 


His 


Asp 


Trp Gly 


Ala 


Cys 


Leu 


Ala 


Pne 


His 


J Z^i 






11D 










120 










125 










Tyr 


Ser 


Tyr 


Glu 


His 


Gin 


Asp 


Lys 


He 


Lys 


Ala 


He 


Val 


His 


Ala 


GlU 


one 




130 










135 










140 












Ser 


Val 


-IT-, T 

vai 


Asp 


Val 


He 


Glu 


Ser 


Trp 


Asp 


Glu 


Trp 


Pro 


Asp 


Tl - 

I le 


Glu 


330 


145 










150 










155 










160 


333 


Glu 


Asp 


lie 


Ala 


Leu 


He 


Lys 


Ser 


Glu 


Glu 


Gly 


Glu 


Lys 


Met 


Val 


Leu 


334 










165 










170 










175 




3 37 


Glu 


Asn 


Asn 


Phe 


Phe 


Val 


Glu 


Thr 


Met 


Leu 


Pro 


Ser 


Lys 


He 


Met 


Arg 


O 1 o 








180 










185 










190 






341 


Lys 


Leu 


Glu 


Pro 


Asp 


Glu 


Val 


Asp 


Ala 


Tyr 


Leu 


Glu 


Pro 


Phe 


Lys 


Glu 


342 






195 










200 










205 










Lys 


Gly 


Glu 


Val 


Arg 


Arg 


Pro 


Thr 


Leu 


Ser 


Trp 


Pro 


Arg 


Glu 


Tift 

I le 


Pro 


34 0 




210 










215 










220 










34 9 


Leu 


Val 


Lys 


Gly 


Gly 


Lys 


Pro 


Asp 


Val 


Val 


Gin 


He 


Val 


Arg 


Asn 


Tyr 


"3 cr r\ 
3 OU 


225 










230 










235 










^4 0 


353 


Asn 


Ala 


Tyr 


Leu 


Arg 


Ala 


Ser 


Asp 


Asp 


Leu 


Pro 


Lys 


Met 


Phe 


He 


Glu 


354 










245 










250 










255 




357 


Ser 


Asp 


Pro 


Gly 


Phe 


Phe 


Ser 


Asn 


Ala 


He 


Val 


Glu 


Gly 


Ala 


Lys 


Lys 


358 








260 










265 










270 






361 


Phe 


Pro 


Asn 


Thr 


Glu 


Phe 


Val 


Lys 


Val 


Lys 


Gly 


Leu 


His 


Phe 


Ser 


Gin 


362 






275 










280 










285 








365 


Glu 


Asp 


Ala 


Pro 


Asp 


Glu 


Met 


Gly 


Lys 


Tyr 


He 


Lys 


Ser 


Phe 


Val 


Glu 


366 




290 










295 










300 










369 


Arg 


Val 


Leu 


Lys 


Asn 


Glu 


Gin 




















370 


305 










310 





















370 305 310 . y / j J 

373 <210> SEQ ID NO: 5 Z£aC6 rtnjO /e ' f £ Z/P O s? S & & S /OCT S ec 7 * 

374 <211> LENGTH: 8 ' /? / C 



C--> 376 <213> ORGANISMrC Artificial 

378 <220> FEATURE: 



379 <223> OTHER INFORMATION: Description of Artificial Sequence: Protease recognition sequence 

3 81 <4 00> SEQUENCE: 5 

383 Ser Gin Asn Tyr Pro He Val Gin 

384 1 - 5 

The types of errors shown exist throughout the Sequence Listing. Please check 
subsequent sequences for similar errors. Sequences S~~ Z9 




Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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L 


376 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


5 


L 


389 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


. ID# 


6 


L 


402 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


IDfr 


7 


L 


415 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


8 


L 


428 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


9 


L 


441 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


10 


L 


4 54 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


11 


L 


4 67 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


12 


L 


4 80 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


13 


L 


4 96 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


14 


L 


509 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


15 


L 


522 


H 


220 


^ *. 


Keyword m is spelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


16 


L 


535 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID $ 


17 


L 


54 8 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


18 


L 


561 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


19 


L 


D / 4 


M 


220 


C : 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


20 


L 


587 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


21 


L 


600 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


22 


L- 


613 


M: 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID#:22 












L: 


619 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


23 


L: 


632 


M: 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID#:23 












L: 


638 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


24 


L: 


651 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


25 


L: 


664 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID* 


26 


L: 


677 


M: 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID#:26 












L: 


683 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


27 


L: 


696 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


28 


L: 


709 


M: 


220 


C: 


Keyword misspelled 


or 


invalid 


format, 


<213> 


ORGANISM 


for 


SEQ 


ID# 


29 
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^vrR!^ . TECH CENTER 1600/2900 

<110> Chemicon International, Inc. 
LENG, Jay 

<12 0> PROTEASE SPECIFIC CLEAVABLE LUCIFERASES AND METHODS OF USE THEREOF 

<130> CHEM1110 

<140> US 09/619,047 
<141> 2000-07-18 

<160> 29 

<170> Patentln version 3.0 

<210> 1 

<211> 936 

<213> Renilla reniformis 
<220> 

<221> CDS 

<222> (1) . . (936) 

<400> 1 

atg act teg aaa gtt tat gat cca gaa caa agg aaa egg atg ata act 48 

Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met lie Thr 
15 10 15 

ggt ccg cag tgg tgg gec aga tgt aaa caa atg aat gtt ctt gat tea 96 
Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser 
20 25 30 

ttt att aat tat tat gat tea gaa aaa cat gca gaa aat get gtt att 144 
Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val lie 
35 40 45 

ttt tta cat ggt aac gcg gec tct tct tat tta tgg cga cat gtt gtg 192 
Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg His Val Val 
50 55 60 

cca cat att gag cca gta gcg egg tgt att ata cca gat ctt att ggt 240 
Pro His lie Glu Pro Val Ala Arg Cys lie lie Pro Asp Leu lie Gly 
65 70 75 80 

atg ggc aaa tea ggc aaa tct ggt aat ggt tct tat agg tta ctt gat 2 88 

Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp 
85 90 95 

cat tac aaa tat ctt act gca tgg ttt gaa ctt ctt aat tta cca aag 336 
His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 
100 105 110 

aag ate att ttt gtc ggc cat gat tgg ggt get tgt ttg gca ttt cat 3 84 

Lys lie lie Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
115 120 125 



tat age tat gag cat caa gat aag ate aaa gca ata gtt cac get gaa 
Tyr Ser Tyr Glu His Gin Asp Lys lie Lys Ala lie Val His Ala Glu 
130 135 140 



432 



2 



agt gta gta gat gtg att gaa tea tgg gat gaa tgg cct gat att gaa 480 

Ser Val Val Asp Val lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu 

145 150 155 160 

gaa gat att gcg ttg ate aaa tct gaa gaa gga gaa aaa atg gtt ttg 528 

Glu Asp lie Ala Leu lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu 

165 170 175 

gag aat aac ttc ttc gtg gaa acc atg ttg cca tea aaa ate atg aga 576 

Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys lie Met Arg 
180 185 190 

aag tta gaa cca gaa gaa ttt gca gca tat ctt gaa cca ttc aaa gag 624 

Lys Leu Glu Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu 
195 200 205 

aaa ggt gaa gtt cgt cgt cca aca tta tea tgg cct cgt gaa ate ccg 672 

Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro 
210 215 220 

tta gta aaa ggt ggt aaa cct gac gtt gta caa att gtt agg aat tat 72 0 

Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr 

225 230 235 240 

aat get tat eta cgt gca agt gat gat tta cca aaa atg ttt att gaa 768 

Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu 

245 250 255 

teg gat cca gga ttc ttt tec aat get att gtt gaa ggc gec aag aag 816 

Ser Asp Pro Gly Phe Phe Ser Asn Ala lie Val Glu Gly Ala Lys Lys 
260 265 270 

ttt cct aat act gaa ttt gtc aaa gta aaa ggt ctt cat ttt teg caa 864 

Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 
275 280 285 

gaa gat gca cct gat gaa atg gga aaa tat ate aaa teg ttc gtt gag 912 

Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu 
290 295 300 

cga gtt etc aaa aat gaa caa taa 936 
Arg Val Leu Lys Asn Glu Gin 
305 310 



<210> 2 
<211> 311 
<212> PRT 

<213> Renilla reniformis 
<400> 2 

Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met lie Thr 
15 10 15 



Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser 
20 , 25 30 



3 



Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val lie 
35 40 45 



Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg His Val Val 
50 55 60 



Pro His lie Glu Pro Val Ala Arg Cys lie lie Pro Asp Leu lie Gly 
65 70 75 80 



Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp 
85 90 95 



His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 
100 105 110 



Lys lie lie Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
115 120 125 



Tyr Ser Tyr Glu His Gin Asp Lys lie Lys Ala lie Val His Ala Glu 
130 135 140 



Ser Val Val Asp Val lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu 
145 150 155 160 



Glu Asp lie Ala Leu lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu 
165 170 175 



Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys lie Met Arg 
180 185 190 



Lys Leu Glu Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu 
195 200 205 



Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro 
210 215 220 



Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr 
225 230 235 240 



Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu 
245 250 255 



Ser Asp Pro Gly Phe Phe Ser Asn Ala lie Val Glu Gly Ala Lys Lys 
260 265 270 



Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 



4 



275 280 285 



Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu 
290 295 300 



Arg Val Leu Lys Asn Glu Gin 
305 310 



<210> 3 

<211> 936 

<212> DNA 

<213> Renilla reniformis (mutated sequence) 
<220> 

<221> CDS 

<222> (1) . . (936) 

<400> 3 

atg act teg aaa gtt tat gat cca gaa caa agg aaa egg atg ata act 48 
Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met lie Thr 
15 10 15 

ggt ccg cag tgg tgg gec aga tgt aaa caa atg aat gtt ctt gat tea 96 
Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser 
20 25 30 

ttt att aat tat tat gat tea gaa aaa cat gca gaa aat get gtt att 144 
Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val lie 
35 40 45 

ttt tta cat ggt aac gcg gec tct tct tat tta tgg cga cat gtt gtg 192 
Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg His Val Val 
50 55 60 

cca cat att gag cca gta gcg egg tgt att ata cca gat ctt att ggt 240 
Pro His lie Glu Pro Val Ala Arg Cys lie lie Pro Asp Leu lie Gly 
65 70 75 80 

at 9 99 c aaa tca 99 c aaa tct 99 fc aat 99 fc tct tat a 99 tta ctt 9 at 288 
Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp 
85 90 95 

cat tac aaa tat ctt act gca tgg ttt gaa ctt ctt aat tta cca aag 336 
His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 
100 105 110 

aag ate att ttt gtc ggc cat gat tgg ggt get tgt ttg gca ttt cat 384 
Lys lie lie Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
115 120 125 

tat age tat gag cat caa gat aag ate aaa gca ata gtt cac get gaa 432 
Tyr Ser Tyr Glu His Gin Asp Lys lie Lys Ala lie Val His Ala Glu 
130 135 140 

agt gta gta gat gtg att gaa tca tgg gat gaa tgg cct gat att gaa 4 80 

Ser Val Val Asp Val lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu 
145 150 155 160 
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gaa gat att gcg ttg ate aaa tct gaa gaa gga gaa aaa atg gtt ttg 528 

Glu Asp lie Ala Leu lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu 

165 170 175 

gag aat aac ttc ttc gtg gaa acc atg ttg cca tea aaa ate atg aga 576 

Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys lie Met Arg 
180 185 190 

aag tta gaa cca gac gaa gtt gac gca tat ctt gaa cca ttc aaa gag 624 

Lys Leu Glu Pro Asp Glu Val Asp Ala Tyr Leu Glu Pro Phe Lys Glu 
195 200 205 

aaa ggt gaa gtt cgt cgt cca aca tta tea tgg cct cgt gaa ate ccg 672 

Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro 
210 215 220 

tta gta aaa ggt ggt aaa cct gac gtt gta caa att gtt agg aat tat 720 

Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr 
225 230 235 24 G 

aat get tat eta cgt gca agt gat gat tta cca aaa atg ttt att gaa 768 

Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu 

245 250 255 

teg gat cca gga ttc ttt tec aat get att gtt gaa ggc gee aag aag 816 

Ser Asp Pro Gly Phe Phe Ser Asn Ala lie Val Glu Gly Ala Lys Lys 
260 265 270 

ttt cct aat act gaa ttt gtc aaa gta aaa ggt ctt cat ttt teg caa 864 

Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 
275 280 285 

gaa gat gca cct gat gaa atg gga aaa tat ate aaa teg ttc gtt gag 912 

Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu 
290 295 300 

cga gtt etc aaa aat gaa caa taa 93 6 

Arg Val Leu Lys Asn Glu Gin 
305 310 



<210> 4 
<211> 311 
<212> PRT 

<213> Renilla reniformis (mutated sequence) 
<400> 4 

Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg Met lie Thr 
15 10 15 



Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val Leu Asp Ser 
20 25 30 



Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn Ala Val lie 
35 40 45 



Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg His Val Val 



6 



50 55 60 



Pro His lie Glu Pro Val Ala Arg Cys lie lie Pro Asp Leu lie Gly 
65 70 75 80 



Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp 
85 90 95 



His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 
100 105 110 



Lys lie lie Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
115 120 125 



Tyr Ser Tyr Glu His Gin Asp Lys lie Lys Ala lie Val His Ala Glu 
130 135 140 



Ser Val Val Asp Val lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu 
145 150 155 160 



Glu Asp lie Ala Leu lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu 
165 170 175 



Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys lie Met Arg 
180 185 190 



Lys Leu Glu Pro Asp Glu Val Asp Ala Tyr Leu Glu Pro Phe Lys Glu 
195 200 205 



Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro 
210 215 220 



Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr 
225 230 235 240 



Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu 
245 250 255 



Ser Asp Pro Gly Phe Phe Ser Asn Ala lie Val Glu Gly Ala Lys Lys 
260 265 270 



Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 
275 280 285 



Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu 
290 295 300 
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Arg Val Leu Lys Asn Glu Gin 
305 310 



<210> 5 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 5 

Ser Gin Asn Tyr Pro lie Val Gin 
1 5 

<210> 6 

<211> 10 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 6 

Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
15 10 

<210> 7 

<211> 10 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 7 . 

Pro Ser Pro Arg Glu Gly Lys Arg Ser Tyr 
15 10 

<210> 8 

<211> 5 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 8 

Tyr Val Ala Asp Gly 
1 5 

<210> 9 

<211> 8 

<212> PRT 
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<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 9 

Met Phe Gly Gly Ala Lys Lys Arg 
1 5 

<210> 10 

<211> 10 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 10 

Gly Val Val Asn Ala Ser Ser Arg Leu Ala 
15 10 

<210> 11 

<211> 9 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 11 

Leu lie Ala Tyr Leu Lys Lys Ala Thr 
1 5 

<210> 12 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 12 

Val Lys Met Asp Ala Glu Phe 
1 5 

<210> 13 

<211> 17 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 13 



Phe Leu Ala Glu Gly Gly Gly Val Arg Gly Pro Arg Val Val Glu Arg 
15 10 15 
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His 



<210> 
<211> 
<212> 
<213> 


14 
13 
PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


14 








Asp Arg Val Tyr lie His Pro Phe 
1 5 


His Leu Val 
i n 


lie His 




<210> 
<211> 
<212> 
<213> 


15 
8 

PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


15 









Lys Pro Ala Leu Phe Phe Arg Leu 
1 5 

<210> 16 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 16 

lie Glu Pro Asp 
1 

<210> 17 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 17 

Asp Glu Thr Asp 
1 

<210> 18 

<211> 4 

<212> PRT 

<213> Artificial sequence 



10 



<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


18 








Trp Glu His Asp 
1 








<210> 
<211> 
<212> 
<213> 


19 
4 

PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


19 








Tyr Val Ala Asp 
1 








<210> 
<211> 
<212> 
<213> 


20 
4 

PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


20 








Asp Glu His Asp 
1 








<210> 
<211> 
<212> 
<213> 


21 
4 

PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<400> 


21 








Asp Glu Val Asp 
1 








<210> 
<211> 
<212> 
<213> 


22 
4 

PRT 

Artificial sequence 








<220> 
<223> 


Description of Artificial 


sequence : 


Protease 


recognition 


<220> 
<221> 
<222> 
<223> 


VARIANT 
(1) . - (1) 

Xaa is Trp or Ley- 
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<400> 22 

Xaa Glu His Asp 
1 

<210> 23 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 
<220> 

<221> VARIANT 

<222> (3) . . (3) 

<223> Xaa is lie or His 



<400> 23 

Val Glu Xaa Asp 
1 

<210> 24 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 24 

Leu Glu Thr Asp 
1 

<210> 25 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 25 

Leu Glu His Asp 
1 

<210> 26 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 
<220> 

<221> VARIANT 

<222> (1) . . (3) 
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<223> Xaa is any amino acid 



<400> 26 

Xaa Xaa Xaa Asp 
1 

<210> 27 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 27 

Arg Pro Leu Gly lie lie Gly Gly 
1 5 

<210> 28 

<211> 3 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 28 

Glu Gly Arg 
1 

<210> 29 

<211> 3 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: Protease recognition sequence 

<400> 29 



Val Leu Lys 
1 



